Learning to Generate Compositional Color Descriptions
نویسندگان
چکیده
The production of color language is essential for grounded language generation. Color descriptions have many challenging properties: they can be vague, compositionally complex, and denotationally rich. We present an effective approach to generating color descriptions using recurrent neural networks and a Fouriertransformed color representation. Our model outperforms previous work on a conditional language modeling task over a large corpus of naturalistic color descriptions. In addition, probing the model’s output reveals that it can accurately produce not only basic color terms but also descriptors with non-convex denotations (“greenish”), bare modifiers (“bright”, “dull”), and compositional phrases (“faded teal”) not seen in training.
منابع مشابه
Learning to Compose Spatial Relations with Grounded Neural Language Models
Language is compositional: we can generate and interpret novel sentences by having a notion of meaning of their individual parts. Spatial descriptions are grounded in perceptional representations but their meaning is also defined by what neighbouring words they co-occur with. In this paper we examine how language models conditioned on perceptual features can capture the semantics of composed ph...
متن کاملEffect of CO2 Concentration in Injecting Gas on Minimum Miscibility Pressure: Compositional Model and Experimental Study
For technical and economic success of miscible gas injection projects, an accurate laboratory measurement of Minimum Miscibility Pressure (MMP) at reservoir conditions is essential. On the other hand, compositional reservoir simulator is a useful tool in gas injection studies and prediction of MMP. The main goal of this paper is to describe a procedure to generate a three phase sequential t...
متن کاملText2Shape: Generating Shapes from Natural Language by Learning Joint Embeddings
We present a method for generating colored 3D shapes from natural language. To this end, we first learn joint embeddings of freeform text descriptions and colored 3D shapes. Our model combines and extends learning by association and metric learning approaches to learn implicit cross-modal connections, and produces a joint representation that captures the many-to-many relations between language ...
متن کاملModified CLPSO-based fuzzy classification System: Color Image Segmentation
Fuzzy segmentation is an effective way of segmenting out objects in images containing both random noise and varying illumination. In this paper, a modified method based on the Comprehensive Learning Particle Swarm Optimization (CLPSO) is proposed for pixel classification in HSI color space by selecting a fuzzy classification system with minimum number of fuzzy rules and minimum number of incorr...
متن کاملA Social Semiotic Analysis of Social Actors in English-Learning Software Applications
This study drew upon Kress and Van Leeuwen’s (2006, [1996]) visual grammar and Van Leeuwen’s (2008) social semiotic model to interrogate ways through which social actors of different races are visually and textually represented in four award-winning English-learning software packages. The analysis was based on narrative actional/reactional processes at the ideational level; mood, perspective, ...
متن کامل